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In quantum superdense coding, two parties previously sharing entanglement can communicate 
a two bit message by sending a single qubit. We study this feature in the broader framework of 
general probabilistic theories. We consider a particular class of theories in which the local state 
space of the communicating parties corresponds to Euclidean hyperballs of dimension n (the case 
n = 3 corresponds to the Bloch ball of quantum theory). We show that a single n-ball can encode 
at most one bit of information, independently of n. We introduce a bipartite extension of such 
theories for which there exist dense coding protocols such that log 2 (n + 1) bits are communicated 
if entanglement is previously shared by the communicating parties. For n > 3, these protocols are 
more powerful than the quantum one, because more than two bits are communicated by transmission 
of a system that locally encodes at most one bit. We call this phenomenon hyperdense coding. Our 
hyperdense coding protocols imply superadditive classical capacities: two entangled systems can 
encode log 2 (n + 1) > 2 bits, even though each system individually encodes at most one bit. In 
our examples, hyperdense coding and superadditivity of classical capacities come at the expense of 
violating tomographic locality or dynamical continuous reversibility. 


I. INTRODUCTION 

Classical information can be encoded in quantum sys¬ 
tems and reliably recovered. One of the founding results 
in quantum information science [I| is the Holevo theorem 
0. It implies that, fundamentally, the classical capacity 
of N qubits is N bits: N qubits can perfectly encode 
N classical bits, but no more. It follows that classical 
capacities are additive in quantum theory. Though this 
seems quite natural - it would be strange that by com¬ 
bining two systems that locally store one bit more than 
two bits could be encoded - there exist, due to quan¬ 
tum entanglement, quantum channels whose capacities 
to communicate classical Q or quantum information 
can be superadditive. 

Entanglement is also responsible for other counter¬ 
intuitive aspects of the communication properties of 
quantum systems. In particular, it is at the basis of su¬ 
perdense coding Q, one of the fundamental protocols of 
quantum information theory. Though one qubit can lo¬ 
cally encode at most one classical bit, in a superdense 
coding protocol a two bit message can, surprisingly, be 
communicated by the transmission of a single qubit with 
the aid of previously shared entanglement. 

Though the use of entanglement therefore provides an 
advantage over purely local protocols, it has a limited 
communication power because no more than two bits 
can be communicated by transmission of a qubit inde¬ 
pendently of the amount of entanglement that the com¬ 
municating parties share. The amount of quantum infor¬ 
mation that a transmitted quantum system can commu¬ 
nicate is fundamentally limited by its Hilbert space di¬ 
mension [^. This guarantees in particular that no more 
than 2N bits can be communicated in quantum super- 
dense coding using a pair of entangled systems whose 
individual classical capacity is N bits. A hypothetical 


protocol violating this quantum limit, which we denote a 
hyperdense coding protocol in the following, would imply 
a violation of the additivity property mentioned above. 

Our motivation in this paper is to understand better 
the physical principles underlying the limited power of 
quantum superdense coding and the additivity of classi¬ 
cal capacities. Could one conceive theories with hyper¬ 
dense coding and superadditive classical capacities? How 
would such theories differ from quantum theory? What 
physical principles would they violate that are obeyed by 
quantum theory? Can the additivity of classical capaci¬ 
ties and the inexistence of hyperdense coding be consid¬ 
ered as conditions for physically sensible theories? 

We address partially these questions in the framework 
of Generalised Probabilistic Theories ^PTs). These 
were introduced several decades ago and con¬ 

tain classical and quantum theory as special cases. This 
mathematical framework provides a basis for studying 
probabilistic theories that may have non-classical prop¬ 
erties, and for formulating “natural” physical axioms for 
quantum theory [l3 - [l^ . The informati on p roperties of 
GPTs have been extensively studied [20l - l32j |. 

We consider a particular class of GPTs in which the 
state space of the communicating systems corresponds to 
an Euclidean ball of arbitrary dimension n G N. These 
hypersphere theories, as we call them here, have been 
previously investigated [H, [H, [H, They have 

important physical motivations. The cases n = 1 and 
n = 3 correspond locally to the classical and the quan¬ 
tum bit, respectively. The local state space of a general¬ 
ized bit can be deduced to be an n—ball from physically 
motivated axioms M , which include a weak version of 
information causality [39j. 

We show that for arbitrary n, a single n—ball can en¬ 
code at most one bit of information. We then show that 
there exists a bipartite extension of hypersphere theories 
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that includes entangled states for which there exists a su- 
perdense coding protocol whose communication capacity 
is log 2 (n + 1) bits. Thus, for n > 3, this protocol is 
more powerful than the quantum one: it is hyperdense 
coding. Our hyperdense coding protocol imply that by 
using entangled states, two hypersphere systems can be 
used to encode log 2 (n + 1) bits, thus achieving superad¬ 
ditive classical capacities when n > 3. 

We then turn to the interpretation of these results. We 
find a connection between superadditivity of classical ca¬ 
pacities and hyperdense coding with two of the physical 
conditions imposed on GPTs in various derivations of fi¬ 
nite dimensional quantum th eory : continuous reversibil¬ 
ity and tomographic locality |l2l . [Bill. 

This paper is organised as follows. We briefly intro¬ 
duce in Section m the formalism of generalised proba¬ 
bilistic theories and in Section Hill the specific class of hy¬ 
persphere theories that we consider here. In Section HVl 
we investigate different communication scenarios in the 
context of these theories. We introduce a protocol for 
hyperdense coding, which implies the superadditivity of 
classical capacities, as well as for teleportation and en¬ 
tanglement swapping. We conclude with a discussion of 
the physical implications of our results. In particular, we 
present in the Appendices several additional results re¬ 
lating the (in)existence of hyperdense coding and super¬ 
additivity of classical capacities with physical properties 
such as tomographic locality and dynamical reversibility. 


II. GENERAL PROBABILISTIC THEORIES 
A. States and effects 

In general probabilistic theories (GPTs), the space of 
unnormalised states is a proper cone C C The 

space of unnormalised effects is the dual cone of C: C* = 
{e £ R"+^|e • w > 0, Vw G C}. The unit effect u is an 
interior point of C*. A measurement is a set of effects 
that sum to the unit: M = {a £ C*\Y^^ei = u}. An 
unnormalised state uj £ C has positive scalar product 
with the unit effect, u ■ io > 0, while a normalised state 
UJ £ C has unit scalar product with the unit effect, u-uj = 
1. Given normalised state uj and measurement M , the 
probability of outcome i is p(i\Muj) = Ci ■ uj, where 
denotes the Euclidean inner product. 

For any GPT, without loss of generality we can take 
the unit to have the form u = ( q ), where 0 is the null 
vector in R". The space of normalised states can then 
be written as fl = {ujr = (r)|?' £ C M"} with TZ a 
convex set. It is the convex hull of its extremal points, 
the pure states, which cannot be expressed as convex 
combinations of other states. The states in that are 
not pure are called mixed. 

The unnormalised states can be expressed as uj' = Xu! 
with A > 0 and uj £ VI being a normalised state. The zero 
state is the null vector 0 £ The space of physical 

states is often defined as the set of unnormalised states 


uj' such that u ■ ui' < I, which is the convex hull of the 
zero state and the set of normalised states il. An un¬ 
normalised state corresponds to a preparation in which a 
particular outcome is obtained with a probability smaller 
than I. By considering all the outcomes obtained in a 
state preparation, we can consider that a state is always 
prepared with unit probability, hence, we can restrict the 
space of physical states to the set of normalised states 
Here, we only consider normalised states H. 


The space of physically realisable effects will be noted 
£. Any measurement corresponds to a set of effects be¬ 
longing to E. We assume here that the set of all nor¬ 
malised effects is observable, that is,f = {eGC*|e-a;< 
1, Vw £ H}, which is convex. It can be argued that this 
does not need to hold in general, that E could as well 
be restricted to a proper subset of the normalised effects 


B. Bipartite systems 

Let El A C EIb C be the state spaces of 

the systems A and B, and EIab their joint state space. In 
general, EIab can be defined arbitrarily. Unless otherwise 
stated, in the rest of this work we assume that a state 
4> £ EIab defines the outcome probabilities for each pair 
of effects CA £ Ea and bb £ Eb, and that it satisfies two 
natural physical conditions: the no-signalling principle 
and tomographic locality [Biiaiiii. The no-signalling 
principle states that the outcome probabilities for any 
measurement performed on the system A are indepen¬ 
dent of what measurement is performed on the system B 
and vice versa. Tomographic locality states that all states 
4> £ EIab are characterized by the outcome probabilities 
of local measurements performed on A and B. 

It follows [B) mi from these conditions that EIab C 
0 and that El a ®min EIb C EIab Q 

El A ®max EIb, where El a Gimin EIb = convex hulljwA <8 
ujb\uja £ EIa,ujb £ fls} is the minimal tensor product 
and El A G)max EIb = {4>\{ua ® ub) • ^ = 1, (ca ® cb) ■ (jj > 
0 Vca £ Ea,cb £ Eb} is the maximal tensor product. 
Similar definitions can be given for the space of effects: 
Ea ®min Eb = convex hulljeA G) cbIca £ EatCb G Eb} 
and Ea ®max Eb = {E|0 < E ■ (uja ® ujb) < 1 Vwa G 
EIajCJb G Hb}. The minimal tensor product HA®minflB 
includes all the separable states but does not contain 
any entangled states. The maximal tensor product 
El A ®max El B Contains all states that are consistent with 
the no-signalling principle and that give valid probabili¬ 
ties to all local measurements. The states in ElA^maxElB 
that are not in Ha Gimin EIb are entangled. If A or H is 
classical then EIab = El a Gimin EIb = El a ®max EIb, while 
for quantum theory Ha Gimin EIb C EIab C Ha ®max EIb 

fi. 

We define the unit effect of Has as uab = ua®ub. 
It follows that all states (f) £ EIab can be expressed as 
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matrices 



effects are = ^{m) with m G M" and ||m|| = 1 , 
defining also a hypersphere. Each unit vector ||m|| = 1 
defines a canonical measurement: = {em,e_m}- 


where a € , b G R."® and C G R"^ 0 R"^ ■ Similarly, 

all effects E G £ab can be expressed as matrices 


A. Bipartite systems 


E = 



( 2 ) 


where 7 G R, a G R""^, /3 G R”® and E G R”^ ® R”^. 
The probability of obtaining outcome given by effect E 
if the state \s (p \s E ■ (p = Tr(E*0), where ‘t’ denotes 
transposition. The unit effect uab is a matrix m with 
7 = 1 and all other entries zero. The reduced states 
UJA = pUB = (q) and ujb = P^ua = (&) must satisfy 
uiA G flA and ujb G £Ib- Thus, it is required that a G TZa 
and b G TZb- 


C. Dynamics 

A GPT specifies the state spaces and £Iab 

for all physical systems A and B. It also specifies the 
set of allowed transformations on the state spaces. We 
adopt the following consistency condition as the minimal 
physical condition that the allowed transformations must 
satisfy [l^. The set of allowed transformations 7a on 
system A is consistent if for every Ta & Ta, have 
Ta : IIa —t f^A and Ta® Ib '■ ^^ab —t ^^ab, where ^Iab 
is the joint state space of A and any system B, and Ib 
is the identity map on B. 

In general, the allowed transformations can be repre¬ 
sented as linear maps [^ . Thus, we express any trans¬ 
formation Ta G 7a as a matrix in R"^+^ (8)R"^+^. 

We notice that according to the conditions given above, 
the allowed transformations must be normalisation¬ 
preserving, that is, normalised states are transformed 
into normalised states. It is often considered that trans¬ 
formations T can output an unnormalised state uj' = 
Tuj, with w G n such that 0 < u • w' < 1. These 
are normalisation-decreasing transformations and corre¬ 
spond to obtaining an outcome with probability less than 
1. However, by considering all the possible outcomes, the 
transformations reduce to normalisation-preserving ones. 
Here, we only consider normalisation-preserving transfor¬ 
mations, as given by the consistency condition. 


III. HYPERSPHERE THEORIES 

We introduce in this section a family of GPTs that 
we call hypersphere theories. These theories have been 
studied before [H, [H, [l^, The state space of 

single systems in hypersphere theories (HSTs) is defined 
as the unit ball of dimension n, = {ojr = (r)k ^ 
77 } with 77 = {r G R”|||r|| < l}, for which the set of 
pure states ||r|| = I defines a hypersphere. The extremal 


We now introduce a particular extension of HSTs to 
two systems that, as we show in the next section, has hy- 
perdense coding, superadditive classical capacities, tele¬ 
portation and entanglement swapping. We leave it as an 
open question to investigate other possible extensions of 
HSTs, in bipartite and multipartite settings, and their 
corresponding communication properties. 

Before doing so, we introduce a useful algebraic 
structure (based on Hadamard transformations). Let 
p, = {po,pi,...,pN-i) G {0,1}^ and u = 

(no, ui,. .., G {0,1}^ be N bit strings. We define 

2^ vectors G R^ , labelled by p, whose components, 
labelled by n, are 

(d^). = (-1)'^-, (3) 

where p ■ v = ©^Q^^/r'z, with © denoting sum mod¬ 
ulo 2 (we note that (d^)^ is the p, v entry of a real 
Hadamard matrix). We define this set of vectors as 
'E>n = {d/ilMelo.ij'v. 

It is easy to see that (do)i/ = (d^)o = 1 for all 
p,v ^ {0,1}'^, where 0 = (0, 0,..., 0) is the string with 
N zero entries. Additionally, for p 0, we have that 
dfj, has 2-^“^ entries equal to —1, and the other 2^^“^ 
entries equal to +1. To see this, suppose that p has ex¬ 
actly k entries equal to 1, with I < A: < TV, in positions 
Thus, we have pp = pi^ = ■ ■ ■ = pi^ = 1, 
with all other entries equal to zero. It follows from ([3]) 
that = — 1 iff = 1, where the sum is modulo 

two, which holds for exactly half of the possible values of 
n, that is, for 2^“^ cases. Thus, d^ has an even number 
of —I entries for N > 2 and p 0- 

Furthermore, it follows easily from ([3]) that Vn is 
closed under the element-wise product: 

dfj. o dfj,i = G 77jv, (4) 

for all p,p' G {0,1}^. That is, (d^ o d^/)i, = 

(dy,)„idfj,p,, = (d^e^Oi^ for all p,p',n G {0,1}^, where 
p®p' = (^0 ©Mo, Ml ©Ml,.. ■,Pn-i® p'n-i)- Thus Vn 
is a group under the element-wise product, with do be¬ 
ing the identity element. Moreover, the elements of T>pf 
satisfy: 

^ (d^), = 2'^d,,o, (5) 

df,-df,j =^{dy,),,(d^Py = 2^5f,^f,j. (6) 

V 

Let us now define the set SIab of bipartite states for 
systems A and B that are locally described by HST sys¬ 
tems of dimension n = 2^ — 1. We take 

Has = convex hull {Ha ©min ^^BjSn} 
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to be the convex hull of the product states and the dis¬ 
crete set of entangled states Sn = defined 

as the set of diagonal matrices 

= diag(d^), (7) 

whose entries are given by {4)^)vy = where 

v.i'' G {0, l}-^. The corresponding reduced states for 
systems A and B are the completely “mixed” states cua = 
= (o) S riA and lob = = (q) € ^ b , with 0 

being the null vector in K.^ 

We introduce a discrete set of entangled effects J^n = 
{E^L}f,e{o,i}^, through 

= 2-^ ct>^. ( 8 ) 

This set defines a measurement 

M = {£'/i}M6{0.1}"; (9) 

because 

Ef,= diag(l, 0) = UAB, ( 10 ) 

as follows from JS]), d?]) and (|S]). The probability to get 

outcome /i when measuring the entangled state 4>^i is 

=2-^df,-df,> ( 11 ) 

as can be deduced from © - ®. 

Finally, we define the allowed local transformations as 
the convex hull of the discrete set 7n = 
where 

( 12 ) 

Note that To = / is the identity in . Furthermore, 
from ®, © and (EH), we have 

Tfx'Tf^ = G Tn, (13) 

for all /r' G {0,1}^. Thus, we have that T/v is a group 
under matrix multiplication. Since the vectors d^ have 
only ±1 entries with an even number of —1 entries for 
N > 2, it follows from © and EH that Tn is a subgroup 
of SO(2^) for N>2. 

Since (T^)o.o = (d/i)o ~ 1 d ^ {0,1}^> we can 

express the matrices as 

It follows that Tn = {’i)i}Aie{o,i}'^ is a subgroup of 
SO(2^ - 1) for N>2, and Ti = 0(1). 


B. Consistency of the above definitions 

We show that the above definitions make sense in the 
GPT framework considered here, i.e., that the set ^ab of 
bipartite states is contained in the maximal tensor prod¬ 
uct ^Ia (8imax 11 b, that the set of effects En is contained 
in the space of normalized effects £ab of £Iab, and that 
the local transformations Tn are consistent in the sense 
that T^ : ft a —t ft a and (T^ 0 /b) : Hab —t ftAB for 
all T^ G Tn- Additionally, we show that tomographic 
locality is satisfied. 

Let us first show that Tn satisfies the consistency 
condition. Consider Wq G ft a- From EH, we have 
T^Ua = (), where = f^a. Since G 0(2^-l) and 
Wa G ft A, we have ||a;_j|| = ||a|| < 1. Thus, T^Wq G ftA- 
Consider now (j) G ftAB- If <)> G ftA Cimin Hb, it fol¬ 
lows straightforwardly that T^cj) G ftAB- If 4> is one 
of the (p/j,' G Sn, we have from EH and EH that 
G Sn G ftAB for any G Sn - 

Let us now show that Sn C ftA®maxftB- This implies 
from the definition of 11 ab that Ha C)min Hb C Hab C 
Ha <8)max ftB- Let pfj, G Sn and consider arbitrary ex¬ 
tremal effects Ba = G £a and ep = G £b, 

where a,j3 ^ and ||a|| = ||,d|| = 1. We have that 

(e„ ® 6 / 3 ) • • (/ 3 J = i(l-ka-/3^), (15) 

where (3^ = T^j3. Since Tf^ G 0(2^ — 1), we have ||/3;i|| = 
||/3|| = 1. Thus, 0 < {ea^ep)- (j)f^ < 5 < 1- Furthermore, 
from and © ,we have that (uAffiUB) Tfj, = ('^m)o = 1 , 
which proves the claim. 

In addition, tomographic locality is satisfied. It is 
straightforward to see that the entries {(Tvy of the states 
(j) G ftAB are determined by the local measurements 
Ma = and Mb = 

on systems A and B, respectively, where G is 

a vector whose :zth entry equals unity and whose other 
entries are zero. 

Finally, we verify that En C £ab- We need to show 
that, for any E^ G En, we have: i) 0 < E^ - (uJa<S) 0 Jb) < 1 
for arbitrary pure states Wq = (^) G Ha and ujb = (1) G 
ftB, where a,b G and ||a|| = || 6 || = 1; and ii) 

0 Efj_- (p^i < 1 for arbitrary entangled states G Sn- 

We show i). From ®, EH and EH, we have 

E^-{uJa®uJb) = 2~^(j)^-{u;a‘S>uJb) = 2 “^(l-|-a- 6 ^), (16) 

where bf_i = T^b. Since G 0(2^ — 1), we have ||5;_j|| = 
|| 6 || = 1. Thus, 0 < • (uja ® ^b) < < 1, as 

claimed. 

We show ii). This is implied by EH- 
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IV. COMMUNICATION IN GPTS AND 
HYPERSPHERE THEORIES 

A. Classical capacities 

Consider a situation where Alice sends Bob a classical 
message x with probability p(x) by encoding it in a GPT 
state LJx and where Bob decodes the message using a de¬ 
coding measurement M = {cy S Given 

that the message x was sent, Bob obtains the outcome y 
with probability p(?/|a;) = ey-ojx- The mutual information 

I{X : Y) = Y^^yP{xy)log 2 (^pixy)/{p{x)p{y))'^ quanti¬ 
fies the amount of classical information that is transmit¬ 
ted through such a protocol. 

Classical capacity of a GPT. The classical capacity 
Xc(^) of a GPT with state space 0 is the maximum of 
I(X : Y) over all probability distributions p{x), encoding 
states LOx € fl, and decoding measurements M. 

The classical capacity of d-dimensional classical theory 
(a classical dit) is log 2 d. Similarly, the classical capacity 
of a qudit is log 2 d, as follows from the Holevo bound 
0. The following proposition allows one to put upper 
bounds on the classical capacities of GPTs. It will prove 
useful below. 

Proposition 1. Consider a GPT with state space = 
[wr = (J.)|r’G7?.C M"} and unit effect u = (q). As¬ 
sume that any effect e & £ can be expresed as e = j , 
with 7 G [0,1] and m G Ad C K". Then the classical 
capacity of the GPT Cl is bounded by 

Xc{^) < max {log 2 (l -f TO • r)} < log 2 (l -f MR), 

r^'K,7n£A4 

where M = maxmGAt{||m||} and R = maxrg 7 ?,{||r||} . 

Proof. Let X be the random variable corresponding to 
messages x, chosen with probability p{x) = Px, encoded 
in states ujx = (r-i ) ? where G TZ. Let Y be the random 
variable of the measurement outcomes y corresponding to 
effects Cy = 7y ()) where 7 y G [0,1] and my G Ad. The 
condition Cy = u= ( q ) implies 

= = 0 - ( 17 ) 

y y 

The outcome probabilities are 

p{y\x)=ey-ujx="fy{l-^my-rx). (18) 

The condition p{y\x) > 0 implies that 

my -rx > -1, (19) 

for 7 y > 0. Notice that the case 7 y = 0 corresponds to 
probabilities p{y\x) = 0 which have null contributions to 
the Shannon entropies H(Y\X) and H{Y), and hence to 
the mutual information I{X : Y). Thus, without loss of 
generality we consider that 7 y > 0 , for which (fTOl) holds. 


From the dehnition of the mutual information I[X : 
Y) = H{Y) — H(Y\X) and (flT^ . it is straightforward to 
obtain the expression 

I(X :Y) = Li- L2, (20) 

where 

Li=y^Pxy^jy{l -l-TOy-ra:)log 2 (l -\-my-rx), (21) 

X y 

T2=XAy(l + my-f)log2(l-hTOy-f), (22) 

y 

and f = 

Let D = maxr,,g 7 ?,,m„GAt{log 2 (l + my ■ r^,)}. We have 
from (EH), the definition of D, and EH, that 

Li < E P=c E '7^(1 + '^v ’ 

X y 

y y 

= D, (23) 

Denote Zy = l-\-my-f. We have Zy > 0 and J^yly^v — 
1 with 7 y > 0 a probability distribution since J2y 7y = 1- 
Hence we have 


L2 ='^lyZyl0g2 Zy, 
V 





\ V ) 

\ y / 


= 0 


(24) 

where we have used convexity of 

x\og 2 X for X > 0. 

□ 


Proposition 2. The classical capacity of hypersphere 
theories is equal to 1 bit. 

Proof. It is immediate to check that the channel in which 
Alice prepares pure states ujr and uj-r, with ||r|| = 1, each 
with probability and Bob carries out measurement 
M = {^()), 5 (-r)}, has capacity 1 bit. Hence the 
capacity of bypersphere theories is at least 1 bit. 

The converse follows from Proposition[I] as in the case 
of hypersphere theories, we have M = R = 1. □ 

Superadditivity of classical capacities. The classical 
capacity xg{^ab) of a GPT with state space ^Iab is 
superadditive if xc{^ab) > Xci^A) + Xci^B)- 

The classical capacities of classical and quantum the¬ 
ory are additive: xc{^ab) = Xg{^a) + Xg{^b)- We 
show below that the hypersphere theories defined in sec¬ 
tion m have superadditive classical capacities. 
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B. Dense coding 

The classical capacity of a GPT is the maximum clas¬ 
sical information that can be transmitted without the 
assistance of previously shared resources. The classical 
capacity can sometimes be enhanced by the use of pre¬ 
viously shared entanglement, as in the following general 
dense coding protocol. 

In a dense coding protocol, Alice and Bob initially 
share a bipartite state (j) G ^ab- Alice chooses a mes¬ 
sage X with probability Px from a finite set and ap¬ 
plies the local transformation T^, G 7 a on her system 
A. After Alice’s operation, the state is transformed 
into (j)x = Tx(j>. Alice sends Bob her system. Af¬ 
ter receiving system A, Bob applies a measurement on 
the composite system AB, defined by the set of effects 
{Ey G £AB\J2y — '^ab}, obtaining the outcome y 
with probability p{y\x) = Ey ■ (px- 

Note that following our notation given by (HD and (ED, 
Ey and (px are real matrices, Tx<p denotes matrix mul¬ 
tiplication, and Ey ■ (px = Av{Eypx)^ where ‘t’ denotes 
transposition. 

Dense coding capacity of a state. The dense coding 
capacity X'dc{4’) of a GPT state p G ^ab is Xb>c{P) = 
max/(X : Y) where the max is taken over all dense cod¬ 
ing protocols in which the initially shared state is p and 
where I{X : T) is the mutual information between x and 

y- 

Dense coding capacity of a GPT. The dense coding 
capacity xdc(^^ab) of a GPT with state space Has is 
the maximum of xdc(</>) over all states p G Has- 

Note that we trivially have the following inequality, 
since a dense coding protocol can be viewed as a commu¬ 
nication protocol in which the states Txp are prepared: 

Proposition 3. The \ classical and | dense coding\ capac¬ 
ities of a GPT with bipartite state space GLab satisfy 
Xdc{^ab) < Xci^AB)- 

The terms ‘dense coding’ and ‘superdense coding’ are 
usually treated as synonyms in quantum theory. We use 
here the terminology ‘superdense coding’ as follows. 

Superdense coding. A superdense coding (SDC) pro¬ 
tocol is a dense coding protocol implemented with a state 
p G Has whose capacity satisfies xociP) > Xc(^a)- 

We prove now that superdense coding requires some 
type of entangled states, hence, it is impossible in classi¬ 
cal probabilistic theories. 

Proposition 4. Superdense coding is impossible in a 
GPT with joint state space TIab = ^A 

Proof. Suppose that Hab = ^A Gimin ^b and consider 
the dense coding protocols described above. The initial 
state p G r^AB shared by Alice and Bob is separable 
p = G) w', where coz G Ha, G Hb and {qz}z 


is a probability distribution. After Alice’s operation Tx, 
the state is transformed into px = ^^qzUiz,x®‘-^'z^ where 
= TxLOz G Ha. Bob’s measurement is M = {Ey G 
^Asly The probability of outcome y given input x is 
p{y\x) = Ey ■ qz(^z,x uj{). Gonvexity of the mutual 
information implies that I{X : Y) < 
where Iz{X : T) is the mutual information between x 
and y when the initial state is uJz,x ® w'. Hence we can 
take the initial state to be a product state p = uj ® uj' , 
and drop the label z. We can write p{y\x) = Ey ■ {ujx ® 
uj') = LUx ■ e'y where uix = TxCo and e'y = EyUj'. The set 
{e'yjy constitutes a measurement on Ha (i.e. e'y ■ uj > 
0 Vcli G Ha and e'y = ua, which are implied by the 
fact that Q < Ey ■ p ior all p G Has and Ey = uab)- 
Therefore p{y\x) = e'y ■ Wx is & probability distribution 
that can be obtained by a measurement on Ha. Therefore 
Xdc(Hab) < Xc(Ha), hence, there cannot be superdense 
coding. □ 

Intuitively, one also expects that superdense coding 
requires entangled effects, that is, that superdense coding 
is impossible if £ab = £A®min£B- It is easy to show that 
superdense coding is impossible if Bob’s measurement is 
a convex combination of product measurements: 

Proposition 5. Superdense coding is impossible if Bob’s 
decoding measurement consists of effects of the form 
^yiy2 = Sz Qz^yi ® fy2 with {qz}z 0 , probability dis¬ 
tribution, and {ey{^ G £A\Yyi^v^i = ''-'a} and {fyt^ G 

^B\Yy 2 fy^'^ ~ '“■b} being measurements on systems A 
and B, respectively. 

Proof. Let Bob’s measurement correspond to effects 
^yi,y2 — ® /:y2^ ’ 

Z 

with G £A\Yy^e^yi = UA}y^ and G 

^B\Yy 2 fv^^ ~ being measurements on systems 

A anci B, respectively. We show that in this case 
there cannot be [superdense coding! that is, we show that 
I{X : Yi,Y 2 ) < xc(Ha), where I{X : Y) is the mutual 
information between x and y. 

Gonvexity of the mutual information implies that I (A : 
Y) < YzIzi^ziX : Y) where Iz{X : Y) is the mu¬ 
tual information between x and y for the measurement 
^yily 2 = ® fm- Hence, we can drop the index z, 

and consider a product measurement Ey,^ ^y^ = Cy.^ ® fy .2 ■ 
The probability of outcome yi,y 2 on state px = Txp is 
p{yi,y 2 \x) = (g) fy. 2 ) ' (l>x- The no-signalling prin¬ 

ciple implies that p{y 2 \x) = p{y 2 ), hence, p{yi,y 2 \x) = 
p{y 2 )p{yi\y 2 ,x). Therefore, I[X : 17 ,^ 2 ) = I{X : 
Y 2 ) + I{X : YilFa) = /(A : YilAa), since T 2 is inde¬ 
pendent of A. 

We have that p{yi\y 2 ,x) = p{y 2 \x)~'^p{yi,y 2 \x) = Cy^ ■ 
ujx,y 2 , where uJx,y 2 = ((ua ® fy^) ' 4’x) Pxfy 2 - This is 
because p{y 2 \x) = YyiPiyi^V^lx) = {ua ® fy^) ■ (f>x and 
p{yi,y 2 \x) = {eyj®fy 2 )-px = Cyi ■{Pxfy 2 )- We also have 
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that iOx,y 2 G because i) for any effect e € £a holds 
that e • ujx,y 2 > 0, due to the fact that (e ® fy^) ■ i’x > 0, 
and ii) ua ■ <-^x,y 2 = 1, as follows from the definition of 
ujx,y 2 - Thus, we have I{X : yi|y 2 ) < Xc{^a)- □ 

We note that if £ab = £a ®min £b, the most general 
measurements do not consist only of convex combinations 
of product measurements as above, but also include mea¬ 
surements in which each effect is a convex combination 
of product effects: {Ey = z ®= 

uab}, with € £a, fy""^ & £b (but {e^i^} and {/yj'} 
do not necessarily need to define independent measure- 
ments, i.e., we do not need to have = ua and 

J2y2 fy 2 ^ ~ ub)- This set of measurements include in 
particular those in which the choice of measurement im¬ 
plemented on one of the systems depends on the outcome 
obtained on the other system (what is sometimes named 
as “wirings”). For example, by setting y = (?/i,y 2 ) and 
^(!/i,y 2 ) — I ^ = 0 otherwise, we ob¬ 
tain effects of the form Ey^^y^ = 0 fyT'^- Another 

example of a measurement in this more general set that 
cannot be viewed as a convex sum of product measure¬ 
ments is given in quantum theory by the phenomenon of 
quantum nonlocality without entanglement [i^ . We leave 
it as an open question whether superdense coding is pos¬ 
sible with this class of measurements. A step towards 
proving this generalization of Proposition is the fact 
that superdense coding is impossible if £Iab is a state 
space in generalized nonsignalling theory [2^ , also called 
boxworld [4l|, where all nonsignalling states are allowed. 

C. Hyperdense coding 

Quantum theory allows superdense coding, since e.g. 
for the maximally entangled state (/) of two qubits, 
Xdc(<^) = where £Ia is the state space of sin¬ 

gle qubits However, superdense coding is limited in 
quantum theory as Xdc(<)’) < “^Xci^A) for any Qa and 
(f) € £Iab ■ We call hyperdense a dense coding protocol 
overcoming this quantum limitation. 

Hyperdense coding. A hyperdense coding (HDC) pro¬ 
tocol is a dense coding protocol implemented with a state 
(j) G £Iab whose capacity satisfies xdc(<^) > “^Xci^A)- 

Note that if xc(f^A) = Xc{^b), for example if = 
Hb, then hyperdense coding implies superadditive clas¬ 
sical capacities: xc{^ab) > Xbc{^ab) > 2xc{^a) = 
Xc(^^a) + Xc(f^B)) as follows from Proposition [3] and the 
given definitions. 

Proposition 6. Hyperdense coding is impossible in 
quantum theory. 

Proof. We give two simple arguments within quantum 
theory. First, the convexity of the mutual informa¬ 
tion [42l| and the convex decomposition of mixed states 
into pure states imply that the dense coding capacity 


Xdc(^^ab) is achieved by pure states. Furthermore, in 
the case that the state (p G ^Iab initially shared by 
Alice and Bob is pure and Alice’s quantum system A 
has Hilbert space dimension d, the Schmidt decomposi¬ 
tion of pure states [l| implies that the dimension of the 
Hilbert space 4> that is accessible by Alice’s transforma¬ 
tions on A is no greater than d^. Thus, we have that 
Xdc(^^ab) < 21og2 d = 2xc{^a)- 

For the second alternative argument, consider clas¬ 
sical systems C and D, initially uncorrelated to Al¬ 
ice’s and Bob’s joint quantum system AH, that record 
Alice’s preparation x and Bob’s measurement outcome 
y, respectively. The principle of quantum information 
causality Q states that the quantum mutual informa¬ 
tion between system C and the joint system ABD, af¬ 
ter Alice’s transmission of a qudit A satisfies the bound 
Iq{C : ABD) < 21og2d, independently of how big the 
Hilbert space dimension of Bob’s system B might be. 
It is easy to see from the data-processing inequality [I| 
that I{X : Y) < Iq{C : ABD), from which follows that 
XDc(f^AB) < 21og2d = 2xc(f^A)- □ 


D. Hyperdense coding in hypersphere theories 

We now introduce a dense coding protocol in hyper¬ 
sphere theories of dimension n = 2^ — 1 which is hy¬ 
perdense for > 2. Alice and Bob initially share the 
entangled state po = I € Sn C £Iab, which is the iden¬ 
tity matrix in 0 , as defined by ©• We consider 

messages x G {0,1}-^. To encode x, Alice applies the 
local transformation Tx G Tn given by (I12L which en¬ 
codes the message x G {0,1}^. From (IT3)) and m, 
the joint state transforms into Tx4>o = (px & Sn. Alice 
sends the system A to Bob. Bob applies the measurement 
M = {Ey} yg{o,i}N given by (O on the joint system AB. 
The probability that Bob obtains outcome y when Alice 
encodes the message x is p{y\x) = Ey ■ px = ^y,x- Thus, 
Bob decodes Alice’s N bit message perfectly. This shows 
that 

Xdc(^^ab) > N . (26) 

Since the classical capacity of individual systems in HSTs 
is 1 bit, this provides an example of hyperdense coding for 
N > 2. By taking N ^ oo, Bob can learn an arbitrarily 
large amount of information by receiving a system that 
locally can only encode up to one bit. 

The above dense coding protocol can be turned into a 
classical communication protocol in which Alice encodes 
message x in the state px of two HST systems, and Bob 
decodes the message perfectly using measurement M = 
{Ey}y^{Q i}N. Hence (see Proposition [3]) 

‘2N > xd^AB) > XDc(f^AB) > N . (27) 

Even though each system has capacity of 1 bit, together 
they have capacity of at least N bits, with arbitrary 


TV G N. Thus, in the theories defined above, the clas¬ 
sical capacities are superadditive. 

The upper bound of 2N (the left hand side of (E71) i 
follows from the fact that the classical capacity of any 
GPT is bounded by the log 2 of the dimension of the state 
space [ 2 ^ , which in the case of entangled states for a pair 
of HST systems, as defined above, is 2N. Thus, the gap 
we exhibit between single system classical capacity and 
two system classical capacity is close to optimal. 


E. Teleportation and entanglement swapping in 
hypersphere theories 

Quantum teleportation and entanglement swap¬ 
ping [ 4 ^ are fundamental protocols of quantum informa¬ 
tion theory. We show how they can also be realized in 
the context of hypersphere theories considered here. We 
notice that teleportation and entanglement swapping in 
GPTs have been studied before [23,[^lll|. In particular, 
it has been shown that boxworld, the theory admitting all 
nonsignalling correlations, does not have teleportation or 
entanglement swapping [ 2 O, |4l| . General conditions on 
GPTs to support teleportation are given in (2^. 

We consider the following protocol. Alice has a system 
A' in a pure state Wa G ^a' that she wants to teleport to 
Bob’s location. To do so, Alice and Bob initially share 
an entangled state (f>o £ Sn C ^ab given by © in the 
systems A and B, at Alice’s and Bob’s locations, respec¬ 
tively. Alice applies the measurement defined by the en¬ 
tangled effects Ex G En C Sab given by (|9]) on her joint 
system A'A and obtains the outcome x with probability 
Px = 2“^, for X G {0,1}^. Alice sends Bob her outcome. 
Bob applies the correction operation Tx G Tn given by 
(na on his system B and obtains, as we show below, the 
teleported state Wa on B with unit probability. From the 
linearity of the theory, this protocol works too if Alice’s 
input state Wa is mixed. In particular, the system A' can 
be in an entangled state p G SLa'c with another system 
C, leading to entanglement swapping. Furthermore, we 
show that for theories with bipartite state spaces SIab de¬ 
scribed above, our protocols for teleportation and entan¬ 
glement swapping are optimal in the amount of classical 
information sent by Alice, that is, teleportation or en¬ 
tanglement swapping cannot be achieved perfectly with 
a classical channel of capacity smaller than N bits. 

Let us now show that these protocols for telepor¬ 
tation and entanglement swapping work as claimed. 
To do so, we first define a consistent state space for 
a tripartite system: SIa'AB = convex hull {Ha' ®min 
SIab,^a G)min StA'B,^B ^min SIa'a}, where VtAB = 
convex hull {IIa ®min SIb,Sn} and Sn = , 

as before. We define the four party state space in 
a similar way, SIa'Abc = convex hull {SIa'AB Gimin 
Sic, Si A'AC ^min SIb, Si ABC ^min SI A', SI A'BC ^min IIa}- 
We notice that the three and four party state spaces 
are symmetric under any permutation of the systems, 
all product states are included, and the set of bipartite 


entangled states iSjv is included in any bipartition. It fol¬ 
lows that the set of global effects Sa'abc includes all the 
product effects E<^E', where E' £ Sbc and E £ Sa'A, in 
particular for the bipartite entangled effects E,E' £ En, 
for any permutation of the systems A', A, B and C. 

The state Wo is successfully teleported to system B if, 
for any measurement on B, the outcome probabilities are 
those predicted by uja- In our notation, this means that 

Ptel{y\x) = — {{Ex)A'A ® (eyjs) • ((WojA' <8 {(j)T^)A b) 
Px 

— €y * CJa^ 

for any effect Cy £ Eb, where the label ‘tel’ denotes that 
these are Bob’s outcome probabilities after performing 
the teleportation protocol given above. 

We show that the probability that Alice and Bob ob¬ 
tain respective outcomes corresponding to the effects 
Ex £ Sa'a and Cy G Sb is 

Ptei(x,y) = ((Ex)a'A <8> (ey)B) ■ ((Wa)A' G) {</>oT*)ab) 

= 2-^eyUJa. (29) 

Since the probability that Alice obtains outcome x is 
Px = J2yPtei{x,y) and since Y)y^y = E we have 
from (1^ that Px = 2~^. Thus, we have ptei{y\x) = 
^Ptei(a;,2/) = Cy • Wa, as given by (EH). 

We show (Ei). Since the states (px and the local trans¬ 
formations Tx are diagonal matrices, as given by © and 
(fT^ . we have from (|T^ and (fT^ that (poTp, = px- Us¬ 
ing the indices i, j and k for the systems A', A and B, 
respectively, we obtain from (12^ that 

Pte\{x^y) = ^ ^ {Ex)i^j{Gy)k{^a)i{Px)j,k 

— 2 ^ ^ (0a; (Cy )fc (^a)i 

i,j,kG{0,l}^ 

— 2 ^ ^ {4^x')k,k{^y'}k{^a)k{4^x'}k,k 

kG{0,l}^ 

= 2-^eyUJa, (30) 

where in the second line we used the definition for the 
effects Ex given by ®, in the third line we used that the 
states px are diagonal matrices, as given by ©, and in 
the last line we used that the entries of px are I or — 1, 
as given by © and ©. 

The introduced teleportation protocol can easily be ex¬ 
tended to entanglement swapping. Let Alice’s system A' 
be initially in an entangled state p £ Slyc with a sys¬ 
tem C, held by Gharlie. As in the teleportation protocol, 
Alice and Bob share the entangled state po £ SIab in 
systems A and B. Then, Alice and Bob perform the 
teleportation protocol described on the tripartite system 
A'AB. 

The entangled state p is successfully swapped from 
systems A'C to systems BC if the outcome probabilities 
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of any measurement on the joint system BC are those 
predicted by the state (p. The probability that Alice ob¬ 
tains the outcome G Sa'A on the system A'A and 
Bob obtains (by collaborating with Charlie) the outcome 
E'y G £bc on the system BC, after he applies the correc¬ 
tion Tx on B, is 

PswupiXyU) = {{Ex)A'A®{Ey)Bc)-{{.P)A’C®{.4’QTl)AB)- 

(31) 

Entanglement swapping succeeds if Psw&piy\x) = 
^Pswapix, y) = E'y ■ ip for all p G IIa'C and E'y G £bc, 
where Px = J2yPswap{x,y) > 0. This follows straightfor¬ 
wardly as in (HTnil . with px = . 

Finally, we show that for theories with bipartite state 
spaces £Iab, teleportation or entanglement swapping 
cannot be perfectly implemented with a classical channel 
of capacity of less than N bits. We show this property 
for entanglement swapping. Since the linearity of the 
theory allows us to extend a protocol for teleportation to 
a protocol for entanglement swapping, as we did above, 
this property holds for teleportation too. We follow the 
argument given in [i^. Assume that Alice and Bob im¬ 
plement the dense coding protocol of section HV PI with 
an initial state (jjo in systems A and B at Alice’s and 
Bob’s locations, respectively. The state is transformed 
into (px after Alice’s operation, encoding the N bit mes¬ 
sage X. Suppose now that Alice does not send Bob the 
system A, but that Alice and Bob apply some entan¬ 
glement swapping protocol, using some resource state in 
systems A', held by Alice, and B', held by Bob, so that 
at the end of the protocol Bob has the state px in his 
systems B'B. Then, Bob completes the dense coding 
protocol and learns the message x. Assume that such 
a protocol for entanglement swapping requires Alice to 
encode Bob’s correction operation in a message y via a 
classical channel with capacity oi M < N bits. Now 
suppose that Alice does not actually send y to Bob, but 
that Bob instead tries to guess the value of y to com¬ 
plete the protocol without any communication. Then, 
Bob will guess y correctly with probability at most 2“^, 
in which case he completes entanglement swapping suc¬ 
cessfully and then learns x. Thus, Bob obtains Alice’s 
N bit message x with probability > 2~^ without 
any communication from Alice to Bob. This means that, 
from the set of 2^ messages of N bits that Alice can en¬ 
code, there exists at least a pair of messages that Bob can 
distinguish with a nonzero probability without receiving 
any physical system from Alice, which violates the no¬ 
signalling principle. It follows that a classical channel of 
capacity N bits is necessary to complete entanglement 
swapping or teleportation. 


V. DISCUSSION 

Here we have introduced hyperdense coding: super- 
dense coding in which more than two bits of information 
can be communicated by transmission of a system that 


locally encodes at most one bit. We have presented dense 
coding protocols in the context of hypersphere theories, 
in which single systems are described by an Euclidean 
ball of dimension n. Our protocols are hyperdense when 
n > 3. 

It is well known that if one imposes a sufficient set of 
axioms to GPTs then one recovers classical or quantum 
theory Therefore, the theories we have 

introduced must violate at least one of these axioms. The 
theories we introduced violate continuous reversibility. 
Continuous reversibility is an important physical condi¬ 
tion that has been imposed to the framework of GPTs 
in several derivations of finite dimensional quantum the¬ 
ory Continuous reversibility states that 

for every pair of pure states there exists a continuous 
reversible transformation that transforms one into the 
other [T^ . 

The theories that we introduced not only violate con¬ 
tinuous reversibility, but already simply reversibility. In¬ 
deed, the set of transformations acting on a single system 
is discrete, see Eq. m, while the set of local pure states 
is continuous, given by a hypersphere. Thus, not every 
pair of pure states describing a single system is connected 
by a transformation. 

Note further that if we modify the group of transfor¬ 
mations T for the theories given above to be T = SO(n) 
so that continuous reversibility is satisfied (at least for 
local systems), then the space of entangled states must 
be modified and the hyperdense coding protocols we in¬ 
troduced no longer work. In fact, we obtain in this case 
that less than one bit is communicated if n ^ 3, while 2 
bits are communicated if n = 3, which corresponds to the 
Bloch ball (see details in Appendix lAT) . More generally, 
we have shown, under a variety of technical conditions, 
that for arbitrary hypersphere theories, hyperdense cod¬ 
ing is impossible if one imposes continuous reversibility 
on local systems (see Appendix iBll. 

In general, continuous reversibility not only applies to 
single systems with state spaces ft a or Qb, but also to 
composite systems in £Iab, i-e., for every pair of pure 
states in Qab there should exist a continuous reversible 
transformation that transforms one into the other. In 
line with the above remarks, note that imposing con¬ 
tinuous reversibility for composite systems implies that 
superdense coding is possible in HSTs only if n = 3, 
which corres pon ds to the Bloch ball. This follows from 
the result of [l^ showing that for a pair of HST systems 
satisfying continuous reversibility, entangled states only 
exist for the case n = 3. While our theories must thus ob¬ 
viously violate continuous reversibility, note that it is an 
open question whether they satisfy the weaker require¬ 
ment that there merely exists a transformation mapping 
one of the product state to one of the entangled states 
used in our protocol. 

In addition to continuous reversibility, tomographic lo¬ 
cality is another physical conditions that has been used 
to derive finite dimensional quantum theory in the frame¬ 
work of GPTs [H, [H, [H, [l^. Tomographic locality is 
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a physical property stating that the description of en¬ 
tangled states is completely fixed by the outcome prob¬ 


abilities of local measurements [1^ 120| . Violation of to¬ 
mographic locality means that there are global degrees of 
freedom that are inaccessible to local observers. The the¬ 
ories that we have discussed satisfy tomographic locality. 
It is not difficult to build theories that exhibit hyperdense 
coding and that satisfy local continuous reversibility, but 
that violate tomographic locality (see Appendix [C]) . 


We remark that while hyperdense coding must nec¬ 
essarily entail a violation of one of the physical axioms 
defining quantum theory, such as tomographic locality 
or continuous reversibility, it is not necessarily the case 
that any theory violating one of these axioms always al¬ 
lows for the existence of hyperdense coding, even if it 
includes entangled states. Intuitively, one can construct 
theories in which the entangled states are very weekly 
entangled and thus do not achieve hyperdense coding. 
We give examples of this type in Appendix [D1 where we 
construct theories whose local systems are described by 
HSTs and whose bipartite state spaces include a set of 
nontrivial entangled states. These theories satisfy tomo¬ 
graphic locality and violate continuous reversibility, for 
local and global systems. We show that some of these 
theories do not have hyperdense coding, while others do 
not even have superdense coding. 

On a more general level, our hyperdense coding proto¬ 
cols have highly unphysical consequences. Indeed, they 
imply superadditive capacities: the classical capacity of 
the bipartite system AB is greater than the sum of the 
local capacities of A and B. A breakdown of additiv¬ 
ity suggests that in such theories one cannot define a 
unit of information, and hence that the whole framework 
of information theory breaks down. (We note that in 
quantum theory there is a weak breakdown of additivity, 
namely the capacity of a specific channel can be super¬ 
additive Q. This has much less dramatic consequences 
than the superadditivity discussed here). 

Another possible consequence is related to thermody¬ 
namics. Indeed, the entropy of a state uj can be defined, 
as briefly suggested in (2^ , as the maximum classical ca¬ 
pacity of any encoding/decoding protocol in which Alice 
sends Bob the state uj = '^^PxOJx on average. This def¬ 
inition coincides with the Shannon and von Neumann 
entropies in the classical and quantum cases. It dif¬ 
fers however from other definitions of entropy in GPTs 
[H, . Our hyperdense coding protocols show that us¬ 

ing the definition based on classical capacity, entropy can 
be superadditive in GPTs. This suggests that statistical 
mechanics could not be applied to these theories, and 
that they would not have a macroscopic limit. 

For these reasons, it would be interesting to investi¬ 
gate whether additivity of the classical capacity should be 
taken as a basic physical condition for any reasonable the¬ 
ory. We have suggested that it is related to other prop¬ 
erties, including tomographic locality and continuous re¬ 
versibility. What the detailed relation is, and whether 
additivity of classical capacities can replace other phys¬ 


ical conditions previously explored to distinguish quan¬ 
tum theory, is an open question. 
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Appendix A: Locally continuous hypersphere 
theories 


We define precisely the notion of continuous reversibil¬ 
ity for local systems. 

Local Continuous Reversibility (LCR). For a bi¬ 
partite system AB, any pair of pure states for the local 
system A, or B, is connected by a continuous reversible 
transformation. 


We investigate the 


local continuous reversibility 


implications 
the 


on 


of imposing 
dense coding 
protocol given in the main text. In order to ensure the 
consistency condition, we modify the space of states and 
effects in a minimal way, in terms of two parameters A 
and T. Recall that n = 2^ — 1 is the dimension of the 
local HST systems, with A € N. We show that if we 
impose satisfaction of local continuous reversibility then 
our dense coding protocols communicate two bits only 
for A = 2 and no more than one bit for A ^ 2, hence, 
they are superdense only for A = 2, which corresponds 
to the Bloch sphere. 

We consider entangled states and effects similar to the 
ones in the main text, but modified in terms of two pa¬ 
rameters A and r: 



where T)j is defined as in Eq. m of the main text, 
for p G {0, l}'^ and some A, r G [—1,1] that we specify 
below. Notice that the case A = r = 1 corresponds to the 
states and effects given by Eqs. © and ([5]) of the main 
text. Equations ®, ©, (HU) and m of the main text, 
and Eq. dHI) imply that = diag(l,0), 

which is the unit effect uab, for any value of t. 

We impose that local continuous reversibility must be 
satisfied. We must have that Ta = Tb = T, where T is 
defined as the set of transformations of the form 


T = 



(A2) 
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with T G T and 'T being a group of continuous reversible 
transformation that is transitive on the sphere in 
[l^ . It follows that T is a subgroup of 80(2-^ — 1) [^ . 
Since 80(1) = {!}, there are no continuous reversible 
transformations for the case = 1. In fact, the case 
= 1 corresponds to the classical bit, whose transforma¬ 
tions are discrete and correspond toT = 0(1) = {1,-1}. 
Then, there is no superdense coding for A^ = 1 since su- 
perdense coding does not exist in classical probabilistic 
theory. Thus, in the following, we only consider N > 2. 
For N ^ S, the only possible group of continuous re¬ 
versible transformations that is transitive on the sphere 
is T = 80(2^ — 1), while for N = 3, T can be either 
80(7) or G 2 We consider here that T = 80(2^ — 1) 
for all N e N/{1}. 

For a given A, in order to have that the allowed lo¬ 
cal transformations satisfy the consistency condition, it 
must be that S flyis, for all T,T' G T and 

/i G {0,1}^. We thus dehne flAB = convex hull{nA8min 
where A(^) = T' G T} is a con¬ 

tinuous set of entangled states, differently to the discrete 
set Sn defined by the states o of the main text. We 
notice that G A^'^^ for all fj. G {0,1}^. Since |A| < 1, 
following Eq. m of the main text, it is straightforward 
to show that flAB ^ flA <S>ma,x ^B ■ 

We show that C Sab requires 

- (2^ - 1)“^ < Ar < ( 2 ^-3)"\ (A3) 

Consider the states and where T' = 

diag(l, 1, —1, —1..., —1) G <8) R^^. We see that T' 
is of the form ()A2ll with T' G 80(2^ ~ !)■ We have that 

4")-<^(^)=2-^(l+(2^-l)Ar), 

= 2-^ (1 - (2^ - 3) Ar) . (A4) 

Since G SIab, tbe condition Eq '^ G Sab re¬ 
quires 0 < E^^^ ■ < 1 and 0 < E^^^ ■ < 1, which 

from (EH) implies (EH). 

From Eqs. ®, ©, uni) and (fTH) of the main text, and 
Eq. EH), Bob’s outcome probabilities are 

p{y\x) = E^J^ ■ 4^) 

= 2~^(rXdy • da; -I- 1 — Ar) 

= 2~^ {2^ TXSy^x + 1 — Ar) 

= XrSy^^ + 2-^(1 - Ar). (A5) 

Satisfaction of (IA3I) implies that 0 < p{y\x) < 1. The 
probability that Bob learns the correct message is 

p{y = x\x) = P\.r,N, 

< Qn, (A6) 

where 

Py,,^ = 2-'^(l + (2'^-l)Ar), (A7) 

QAr = 2-^+i(2^-2)(2^-3)-^ (A8) 


If y a;, we have p(y|a;) = {2^ — I)“^(l — Pa.t.at)- Given 
that Px = 2.~^ for x G {0,1}'^, it is straightforward to 
obtain that 


I{X:Y)=N- H{Px,r,N), (A9) 

where H{p) = h{p) -|- (1 — p) log 2 (2^ — I) is the Shan¬ 
non entropy of a random variable Z taking values 
Z = Q with probability p and Z = j with probabil¬ 
ity (2^ - l)“Hl - P), for j = 1 , 2 ,..., 2^ - 1 , and 
h{p) = —plog 2 p—(1—p) log 2 (l—p) is the binary entropy. 
The maximum value of I{X : E) is achieved by minimiz¬ 
ing H{P\^r,N), which is obtained for P\.t,n = Qn- We 
thus consider Ar = (2^ — 3)“^, in which case we have 
P\,t,n = Qn- Thus, from (|A9D . the maximum value of 
I{X : Y) for the considered protocols is 

I{X ■.Y)=N-H{Qn)- (AIO) 

The achieved values of I{X : Y) for N = 2,3,4, 5 are 
2,0.15,0.05,0.02, respectively. In general, we see from 
(IA8I1 and (lAlOll that I{X : Y) decreases with N. Thus, 
our protocols achieve superdense coding only for A^ = 2. 


Appendix B: Communication capacities of locally 
continuous hypersphere theories 


We prove the following results stating the impossibility 
of superadditive capacities, hyperdense and superdense 
coding for particular classes of bipartite HST systems. 

Proposition 7. Let SIab be a bipartite state space sat¬ 
isfying the no-signalling principle, tomographic locality 
and \local continuous reversibility with SI a ^ SIb — SI 
a hypersphere system of dimension n > 2. Consider 
states (fx S SIab that encode a message x with proba¬ 
bility Px, whose local states are Wa,. = {a,,) G and 
= (h4 S ^B, with Oxjbx G R" and ||aa;|l < 1, 
ll^aill < 1- Let y be the outcome of a measurement on 
the state (px corresponding to the effect Ey G Sab and 
let I{X : Y) be the mutual information between x and y. 
Let T be the set of allowed local transformations on SIa 
and SIb- In general, their elements are of the form 


T = 



(Bl) 


where the matrices T form the group T. The following 
properties hold: 

i) for states with Ox = 0 or bx — 0, we have L(X : 
Y) < 2 for odd n 7, with equality achieved only for 
n = 3; 

ii) if T = SO{n) and n is odd then L{X Y) < 2 for 
Gx = 0 or bx = 0, with equality achieved only for n = 3; 

Hi) ifT= SO{n) and n is even then L{X : Y) < 2 for 
arbitrary (px G XI ab, and I{X : E) < 1 for states with 
Gx = 0 or bx = 0 ; 

iv) if T = 0{n), which includes continuous and dis¬ 
continuous transformations, we have for any n that. 
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I{X '■ Y) < 2 for arbitrary cpx G ^ab ind I{X : y) < 1 
for states with = 0 or = 0 ; 

v) for states that are obtained by local transformations 
(t>x = Txip, we have I{X :Y)<2ifT = SO{n) with even 
n, or if 7' = 0(n) with arbitrary n. 

Recalling that the classical capacity of a single sys¬ 
tem in hypersphere theories is 1 bit, no protocol using 
the above families of states can achieve superadditivity 
of classical capacities, hyperdense coding or superdense 
coding, for the respective cases. 

The following lemmas are used in the proof of Proposi¬ 
tion [71 given above. They provide important constraints 
on the bipartite states and effects in HSTs for arbitrary 
state spaces limited only by NS, TL and the restric¬ 
tion that ri/i ~ VIb cs n is a HST system of dimension 
n. 


Lemma 1. Let VLab be a bipartite state space that sat¬ 
isfies no-signalling and tomographic locality, with VLa — 
LIb — Lt being the state space of a HST system of di¬ 
mension n. Any state (j) S ^ab can be expressed as a 
matrix 


(j) = 



(B2) 


where a,b G M", C € M” 0 K", ||a|| < 1 , || 6 || < 1 and 
||cfc|| < I 7 with Ck being the kth column vector of C, for 
k = 1 , 2 ,...,n. 


Proof. The principles of no-signalling and tomographic 
locality imply that cj) G LIab can be expressed as the 
matrix (fR^ . with C e M” 0 M” and a,b G M”, as given 

by Eq. (ED of tllG m&m toxt, Hlici B ^ 

LIab C GIa ^max^B [HI. We show below that for HSTs, 
||aj| < 1 , ll&ll < 1 and ||cfe|| < 1 , for fc = 1 , 2 ,..., n. 

Since the respective local states on GIa and LIb are 
tv A = 4>u = (^) and ujb = fAu = (^), where t de¬ 
notes transposition, it follows from the definition of H 
that ||a|| < 1 , ll&ll < 1 . 

We show that ||cfe|| < 1, for fc = 1, 2,..., n. Let Vk G 
R" be a column vector whose fcth entry is equal to unity 
and all other entries are zero. It follows that Cvk = Ck- 
Let ||c/c|| > 0. Consider the following extremal product 
effects: ® and O 

\ ( -ik ) 5 where Ck = . Using expression (IB2I) and 

the property Cvk = Cfc, we obtain 


= i (1 - llcfcll - • 4 + (-1)4,), (B3) 


where 6 , is the kth entry of b and A e {0,1}. Since 
(f G it A Cimax Hb and are product effects, we 

have E^^'^ • ^ > 0, for A G {0,1}. It follows from (IB3I) 
that -I- E^'^) . (j) = \ — lldl > 0, which implies 

llcfell <1. □ 


Lemma 2 . Let Sab be the space for normalised effects 
corresponding to a bipartite state space LIab that satisfies 


no-signalling and tomographic locality, with it a — it b — 
U being a HST system of dimension n. Any effect E G 
Sab can be expressed as a matrix 


E = "f 


1 /3* 

a r 


(B4) 


where 0 < 7 < 1, a,/3 G M", T G M” ® R”, ||a|| < 1, 
||/ 3 || < I7 llbfcll < 1, with 7, being the kth column vector 
ofT, for k= 1,2,...,n. 


Proof. The principles of no-signalling and tomographic 
locality imply that any state (j) G Hab can be expressed 
as the matrix (|B2I) . Thus, any effect E G Sab can be 
expressed as the matrix given by Eq. m of the main 
text: 


E = 



(B5) 


where 7 G R, a, /3 G R”, T G R" 0 R". We show below 


the following inequalities: 

0 < 7 < 1, (B 6 ) 

||a|| < min{ 7 ,1- 7 }, (B7) 

II/3II < min{ 7 , 1 - 7 }, (B 8 ) 

|| 7 ,|| < min{ 7 , 1 - 7 }, (B9) 


for k = 1,2,... ,n. From (USD - (USD, we see that the 
case 7 = 0 corresponds to the zero effect, which has an 
expression in agreement with (IB4I) . For the case 7 > 0, 
it follows from (|B5I) ~ (IB9p that E can be expressed as 
the matrix 


where a' = ft' = 7 “^/? and T' G R” 0 R" has col¬ 

umn vectors 7 ^ = 7"4fc, with lla'II < 1 , WP'W < 1 , || 7 (,|| < 
1, for k = 1,2,..., n. Thus, by dropping the primes in 
(jBToD, we obtain the claimed result (|B4|. 

We complete the proof by showing (IB6I1 - (IB9I) . To do 
so, we use the expression of E given by (IBSD . We apply 
E on particular product states (j) and use the conditions 
0 < E ■ (j) < 1. This follows because it a 0 min C Qab- 

We show (|B 6 D. We apply the effect E on the state 
(fo = (g) 0 (g). We obtain E ■ (fo = 7 . The condition 
0 < E ■ (fo <1 implies (IB 6 I) . 

The inequalities (lB7l) - dBm are trivially satisfied if 
l|o:|| = ||/3|| = llbfell = 0- Thus, below we consider the 
case |la|| > 0 , ||,5|| > 0 , || 7 fc|| > 0 , for fc = 1 , 2 ,..., n. 

We show jED. Consider the states (j)± = () 0 ( 0 ) j 
where d = HallWe have that E-(/)± = 7 ± ||a||. The 
conditions 0 < E ■ (j)± < 1 imply (IBTD . 

We show (IB8I1 . Consider the states = ( 0 ) ® ), 

where /3 = ||,5||“^/3. We have that E = 7 ± ||,5||. The 

conditions 0 < E • < 1 imply (|B8I1 . 

We show Hi). Let Vk G R" be a column vector with 
the kth entry equal to unity and the other entries equal 
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to zero. We have that rf;;,; = Ik- Let ) 

and where % = || 7 fc||“^ 7 fc and /r, i/ G 

{0,1}. Consider the states 4'^^'’^'^ = for & 

{0,1} and k = 1, 2,..., n. We obtain 


where bk and Pk are the fcth entries of b and /3, and Ck and 
7 fe are the fcth column vectors of C and L, respectively. 
Thus, in the case a = 0, we have from (IB16I1 that 

X = /l-6 + r-C<l + 2(/3fc6fc + 7fc-Cfc), (B17) 


E ■ = (-l)'^+ni7fe|| + (-l)^a . ^ + i-irp, + 7, 

(Bll) 

where Pk is the A:th component of p. From the conditions 
0 < i? ■ < 1, we obtain 1 > + 4^k'^'^) = 

|| 7 fc|| +7 and 0 < \E ■ {4^°'^^ + 4^^'°'’) = -||7fe|| +7, and 
hence (IB9I) . which completes the proof. □ 


Proof of Proposition Since ^Iab satisfies the no¬ 
signalling principle and tomographic locality, Lemmas [T] 
and [2] hold. Thus, the states 4x G ^ab and the effects 
Ey G Sab can be expressed as in (lB2l) and (lB4l) : 


4x 


VOx Cx) ’ 




It follows from (IB12I) and Proposition [T] that 


(B12) 


for fc = 1, 2 ..., n. Since P ■ b = X]fe=i Pkbk and T • C = 
Sfe=i 7fc ■ Cfc, it follows from (IB17I) that 

X < < 3, (B18) 

n 

where the second inequality is achieved only for n = 3. 
Thus, from (Eisi) and (IM . we have I{X : T) < 2, 
with equality achieved only for n = 3. The case 6 = 0 is 
proved similarly by considering T' = I and T = Qk- 
We show ii). Since we assume that T = SO(n) for 
arbitrary odd n, the proof follows straightforwardly as 
for i). 

We show hi). We assume n even and T = SO(n). 
Consider equation (IB15I) for the case T = I and T' = —I. 
We have T, T' G SO(n). It follows that 


/(X:r) <log 2 (I + x), (BI3) 

where x = maxjxx.y}, Xx,v = ay ■ + Py ■ b^ ETy ■ 

and the maximum is taken over all states 4x G SIab and 
measurements with effects Ey G Sab ■ Let this maximum 
be achieved by a state 4 and an effect E, for which we 
drop the x and y labels. From (IB12I) . we have 

f;-0 = 7(i + x), (BI4) 


l-ba-a-/3-6-F-C'>0. (BI9) 

Thus, we have 

X < 1 -b 2a • a. (B20) 

If instead we have T = —I and T' = /, we obtain the 
similar expression 


where x = <a'a-|-/3'6-l-F-C. 

The set T of allowed local transformations on HST 
systems Si a — SI b ~ II of dimension n > 2 that satisfy 
local continuous reversibility has elements of the form 
given by Eq. m, where the group T must be transitive 
on the unit sphere in K." M- There are various groups 
with this property. In general, T is a subgroup of SO(n). 
For odd n ^7, the only possibility is T = SO(n) [s^- 

From the consistency condition, we have T4xT'^ G 
SIab for all 4x G SIab and all T, T' G T, hence, 
Ey ■ {T4^T'^) > 0 for all Ey G Sab- Thus, E-{T4T'^) > 0 
for all T, T' G T- It follows from (|B14I) that 

I -b a • a' -b /3 • 6' -b F • C" > 0, (BI5) 

where a' = fa, b' = f'b and C = fcf'\ for all f , f' G 

r. 

We show i). As said above, for odd n ^ 7, local con¬ 
tinuous reversibility implies that T = SO(n). Consider 
Eq. (IB15I) for the case a = 0. Let f = I and f' = Qk 
being a diagonal matrix with all entries equal to —1 ex¬ 
cept for the fcth entry, which equals unity. We have that 
f,f' G SO(n). It follows that 

l-P-b-T-C + 2{Pkbk+jk-Ck)>0, (BI6) 


X < I + 2^ • 6. (B2I) 

From Lemmas [T] and [21 we have ||a|| < I, ||6|| < I, ||a|| < 
1 and 11/311 < I. It follows from (IM . (IBM ' and (j^ 
that I{X : E) < 2 and I{X : E) < 1 if a = 0 or & = 0. 

We show iv). We assume E = 0(n) and arbitrary n. 
Consider equation (IB15I) for the case f = I and f' = —I. 
We have T, f' G 0(n). The proof follows similarly as for 
hi). 

We show v). We assume T — SO(n) and even n, or 
T = 0(n) and arbitrary n. In any case we have —I G E- 
Considering equation (IBI5I) with f = I and f' = —we 
obtain, as in the proof of iii), the expressions (IB20I) and 
dmH), from which it follows that I{X : E) < 2. However, 
the equality I{X : Y) = 2 cannot be achieved by states 
that are obtained by local transformations, 4x = Tx4oy 
as we show. On the one hand, from (jBTSl) and (IBM , 
a necessary condition for the equality /(A : E) = 2 is 
that ||a|| = I, that is, that the local state tOa G SIa is 
pure. On the other hand, if Wq is pure, it is easy to see 
that 4 must be product: 4 = ^b, for some cob G SIb 

[ 23 . Thus, if ||o|| = 1, the bipartite states are product: 
4x = ®ujb- It follows from Proposition |T] that in this 
case /(A : E) < I < 2. □ 
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Appendix C: Locally continuous hyperdense coding 
violating tomographic locality 


We show that relaxing tomographic locality 
allows hyperdense coding while still satisfying 
local continuous reversibilityl 


fn') 

Consider local state spaces ft a — , where 


= 

i — 




0 e 


,r e 


< 1 


I, for m G N 


and n G Z+. The pure states satisfy ||r|| = 1. The unit 
effect is M = (g) with 0 being the null vector in 

(n) 

The state space flm corresponds to an m—hypersphere 
system embedded in a bigger vector space. The space of 
local effects £ is the convex hull of the zero effect, the 
unit effect u and the extremal effects ^ ° j where 

0 is the null vector in R” and r G K."* with ||r|| = 1. In 
what follows we consider n = 2^ — 1, with N G N/{1}. 
The case iV = 1 corresponds to a classical bit. 

We define the set of allowed local transformations as 
Ta — Tb — T- The elements of T are defined by 


rpiR) 

A* 


0 

0 R 


(Cl) 


where R G SO(m) and G Tn, for /r G {0,1}^, as 
defined by Eq. m of the main text. It is straightforward 

(n) 

to see that the local state space rtm remains invariant 
under the set of allowed local transformations T. 
Consider the following entangled states 

* „ - (tr S). (C2) 

where G iSyr is defined by Eq. o of the main text, for 
/r G {0,1}^. That is, is a state given by a diagonal 
matrix whose first n + 1 = 2^ entries correspond to the 
diagonal matrix and the last m entries are zero. 

We define ^Iab = convex hulljnA Cimin A}, where 
A = {T^oT'^\T,T' G T} and where $o is the state 4)^ 
with /j, = (0,0, ...,0) being the string of N zero en¬ 
tries. From Eqs. m and m of the main text, and 
Eqs. m and (IC2l) . it is straightforward to see that 

A = 

We show that £Iab Q ^A'Simax^B- From the definition 
of £Iab, we only need to show that A C £Ia C)max ^b- 

Consider arbitrary effects e = ^ o ^ € £a and / = ^ o ^ G 

£b- We have that x,C G [0,1] and a, P G R™. Let 
4)^ G A. We have that 

(e ® /) • $^ = xC- (C3) 

Since X)C ^ [0) 1]) we have (e 0 /) • <1>^ G [0,1] for all 
/r G {0,1}^. We also have that (ua 0 ub) ■ = 1. 

Thus, A c£l a ®max ^B and £Iab Q ^A Cimax ^B- 

We show that the set of allowed local transformations 
T satisfies the consistency condition. As noticed above, 
the local state space remains invariant under 7”. By 


definition, A remains invariant under local transforma¬ 
tions from 7” too. Thus, from the definition of £Iab, we 
have Tuj G £Ia and T$ G 17 ab for all w G £Ia, G 17ab 
and T gT, and similarly for system B, as required. 

It is easy to see that local continuous reversibility is 
satisfied. Consider any pair of pure states ujrg,ujr^ G 

17m ■ There exist continuous reversible transforma¬ 
tions ^ G T such that Rtq = n (and = 

ro) for any /r G {0,1}'^, hence, Tjl^^LOrg = uiri and 

Tfj, OJri — ^vq ■ 

Similarly, any pair of pure entangled states 
4)^, G A are connected by reversible trans¬ 
formations: as 

follows from Eqs. (|C1I) . (IC2I) and Eq. (fT^ of the 
main text. However, these transformations do not act 
continuously on the states which is easily 

seen from the fact that the set Tn in (ICIII is discrete. 

On the other hand, tomographic locality is violated be¬ 
cause there exist states in 17^^, the entangled states in 
A, that cannot be determined from the outcome proba¬ 
bilities of local measurements performed on 17^ and 17 b ■ 
This is easily seen from (IC3I) because the outcome prob¬ 
abilities of local measurements on states G A are in¬ 
dependent of the state 4)^. Thus, the states in A cannot 
be determined from local measurements. 

We introduce dense coding protocols in the state space 
17AB defined above that are |hyperdense| Alice and Bob 
initially share the state $o given bv (IC2I) . With probabil¬ 
ity Px = 2“'^, Alice implements the local transformation 
defined by (ICll) for some R G SO (to), and the state 
transforms into for x G {0,1}^. Alice 

sends Bob her system. Bob applies the joint measure¬ 
ment defined by the effects 

Fy^2-^%=(ly (C4) 

for y G {0,1}'^, where in the second equality we used the 
definition (|C2ll . and expression ([8]) of the main text. 

We show that {Fy}y^^QiyN defines a measurement. 
First, we show that C Sab- Since 17 ^b = 

convex hull{17A Omin 17 b) A} with A = {4>a;}2;g{o,i}2V) we 
only need to show that 0 < Fy ■ {oja ® ujb) < 1 and 
0 < Fy • <i)a; < 1, for all x,y G {0,1}'^, Ua G SIa and 
ujb G SIb- It is easy to see that Fy ■ (uja ® ujb) = for 
all y G {0, l}'^, Wa G SIa and uib G SIb- Moreover, from 
(IC2I) . (|C4I) and Eq. (fTTl) of the main text, we have 

Fy * 4 * 3 , - Ey - (px - ^y,X^ (^ 3 ) 

for all x,y G {0,1}'^. Thus, {Fy}yg|oq}w C £ab- Sec¬ 
ond, from (IC4I) and Eq. m of the main text, we have 
X)ye{o i}'*' ~ ‘liag(l,0), with 0 being the null vector 

in R"'+'" and n = 2^ — 1, which is the unit effect uab on 
^ab- 

This protocol achieves a mutual information between 
Alice’s and Bob’s random variables X and Y of 

I{X ■.Y)=N, 


(C6) 
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as follows from (IC5|) . Since the classical capacity of the 
systems in VIa or fig is one bit for any value of m, this 
dense coding protocol is |hyperdense| for N > 2. 


if |A| < Ai, and there is not superdense coding if |A| < Aq. 
First, we use Lemma [5] stated in Appendix iBl to write an 
arbitrary effect E in the form 


Appendix D: Hypersphere theories violating local 
continuous reversibility that do not have hyperdense 
coding or superdense coding 

We give an example of a class of nontrivial theo¬ 
ries with local systems being HST systems that sat¬ 
isfy tomographic locality, violate local reversibility and 
[local continuous reversibilityl Thus, in general, the given 
theories violate continuous reversibility. We show that 
some of these theories do not have hyperdense coding! 
while others do not even have [superdense coding 

Consider a theory of two HST systems of dimension 
n = 2^ — 1 and N > 2, with a joint state space 
^AB = convex hull{0^ ®min The set of en¬ 
tangled states is defined by , where 

the states are given by Eq. m- That is, we have 

^ (o Af^) ’ 

where is defined as in Eq. d of the main text, for 
/i G {0,1}^ and some A G [—1,1] that we specify below. 

We note that these theories satisfy tomographic local¬ 
ity and violate local reversibility and local continuous 
reversibility, as the theories defined in Section iHll Since 
we have a discrete set of entangled states, and the sets of 
local pure states are hyperspheres, the consistency con¬ 
dition implies that not all transformations that connect 
the whole set of local pure states are allowed local trans¬ 
formations. Thus, the set of local pure states cannot 
be connected, which violates local reversibility and local 
continuous reversibility. 

Now we show that there are numbers Aq, Ai satisfying 
0 < Ao < Ai < 1 such that there is not hyperdense coding 


E = 7 


1 / 3 ^ 


(D2) 


where 0 < 7 < 1, a,/3 G M”, F G M” 0 M", ||a|| < 1, 
||/3|| < 1 , llbfcll < Ij with 7 fe being the fcth column vector 
of F, for = 1 , 2 ,..., n. 

Second, we note from the convexity of the mutual in¬ 
formation and the definition of ^ab that a dense 
coding protocol that achieves the [dense coding capacity! 
of the theory must use a pure entangled state (j)lt G 
as an initial state. This is because if the initial state is a 
convex combination of several states, being these entan¬ 


gled or product, the [dense coding capacity of the state 
can only decrease. 

Third, using dm]), jmi) and Proposition [TJ we bound 
the dense coding capacity of the theory by 


Xbc{^ab) < log 2 (l + max{Afj, ■ F}). 


(D3) 


Fourth, since the matrix is diagonal with entries 
either 1 or — 1 , and the diagonal entries of F are bounded 
by —1 and 1 , as follows from the bound || 7 fc|| < 1 in (|D 2 ]), 
we obtain from (ID3D that 

XDc(f^AB) <log 2 (l + |A|(2^-l)). (D4) 

Finally, if we set Xj = for j = 0,1, we ob¬ 

tain xpci^AB) < 2 if |A| < Ai, and xdc{^ab) < 1 if 
|A| < Aq. Thus, it follows from the definitions of hyper¬ 
dense coding and superdense coding, and the fact that 
the classical capacities of HSTs is 1 bit that for the class 
of theories given above, there cannot be hyperdense cod¬ 
ing if |A| < Ai, and there cannot be superdense coding if 
|A| < Ao, as claimed. 
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